An Introduction to ULDBs and the Trio System

نویسندگان

  • Omar Benjelloun
  • Anish Das Sarma
  • Chris Hayworth
  • Jennifer Widom
چکیده

We introduce ULDBs: relational databases that add uncertainty and lineage of the data as first-class concepts. The ULDB model underlies the Trio system under development at Stanford. We describe the ULDB model, then present TriQL, our SQL-based query language for ULDBs. TriQL’s semantics over ULDBs is defined both formally and operationally, and TriQL extends SQL with constructs for querying lineage and confidence values. We also briefly describe our initial prototype Trio implementation, which encodes ULDBs in conventional relations and automatically translates TriQL queries into SQL commands over the encoding. We conclude with research directions for ULDBs and the Trio system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Letter from the Editor - in - Chief The Data Engineering Conference

We introduce ULDBs: relational databases that add uncertainty and lineage of the data as first-class concepts. The ULDB model underlies the Trio system under development at Stanford. We describe the ULDB model, then present TriQL, our SQL-based query language for ULDBs. TriQL’s semantics over ULDBs is defined both formally and operationally, and TriQL extends SQL with constructs for querying li...

متن کامل

Data Modifications and Versioning in Trio

This paper presents the first DBMS for uncertain data that incorporates data modifications and a simple versioning system. Our work is in the context of Trio, a project at Stanford for managing data uncertainty and lineage. We establish SQL-based language constructs for data modifications, and an extended data model ULDB that supports these modifications yielding versioned relations. We show th...

متن کامل

Widom Databases with Uncertainty and Lineage

This paper introduces ULDBs, an extension of relational databases with simple yet expressive constructs for representing and manipulating both lineage and uncertainty. Uncertain data and data lineage are two important areas of data management that have been considered extensively in isolation, however many applications require the features in tandem. Fundamentally, lineage enables simple and co...

متن کامل

How DoesStrawson Unify Epistemology, Ontology and Logic

Strawson’s conception of analysis as a ‘connective linguistic analysis’ makes it possible for him to achieve an indefinitely large range of ideas or concepts among them are certain numbers of fundamental, general and pervasive concepts or concept-types which not only are pre-theoretical or ahistorical, but also together constitute a structural framework only within whichlogic, ontology and epis...

متن کامل

Managing Uncertain Data a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

The ubiquity of uncertain data in modern-day applications (such as information extraction, data integration, sensor and RFID networks, and scientific experiments) has resulted in a growing need for techniques to deal with such data. This thesis addresses challenges in managing uncertain data in a principled, usable, and scalable fashion. We identify and explore a fundamental tension between usa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 29  شماره 

صفحات  -

تاریخ انتشار 2006